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(1) GENERAL INFORMATION: 



(i) APPLICANT: THE BOARD OF TRUSTEES OF THE LEIAND STANFORD JUNIOR 
UNIVERSITY 



(ii) TITLE OF INVENTION: Patched Genes and their Use 



(iii) NtJMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Flehr, Hohbachr Test, Albritton & Herbert 

(B) STREET: Four Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94111 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release /l.O, Version /I •BO 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/ 

(B) FILING DATE: 06-OCT-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Rowland, Bertram I 

(B) REGISTRATION NUMBER: 20015 

(C) REFERENCE/DOCKET NUMBER: a60 190-1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-781-1989 

(B) TELEFAX: 415-398-3249 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AACNNCNNTN NATGGCACCC CCNCCCAACC TTTNNNCCNN NTAANCAAAA NNCCCCNTTT 
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NATACCCCCT NTAANAN^^^pCCACCNNNC NNAAANNCCN CTGNANACj^p^GNAAANCCN 120 

TTTTTNAACC CCCCCCACCC GGAATTCCNA NTNNCCNCCC CCAAATTACA ACTCCAGNCC 180 

AAAATTNANA NAATTGGTCC TAACCTAACC NATNGTTGTT ACGGTTTCCC CCCCCAAATA 240 

CATGCACTGG CCCGAACACT TGATCGTTGC CGTTCCAATA AGAATAAATC TGGTCATATT 300 

AAACAAGCCN AAAGCTTTAC AAACTGTTGT ACAATTAATC GGCGAACACG AACTGTTCGA 360 

ATTCTGGTCT GGACATTACA AAGTGCACCA CATCGGATGG AACCAGGAGA AGGCCACAAC 420 

CGTACTGAAC GCCTGGCAGA AGAAGTTCGC ACAGGTTGGT GGTTGGCGCA AGGAGTAGAG 480 

TGAATGGTGG TAATTTTTGG TTGTTCCAGG AGGTGGATCG TCTGACGAAG AGCAAGAAGT 540 

CGTCGAATTA CATCTTCGTG ACGTTCTCCA CCGCCAATTT GAACAAGATG TTGAAGGAGG 600 

^ CGTCGAANAC GGACGTGGTG AAGCTGGGGG TGGTGCTGGG GGTGGCGGCG GTGTACGGGT 660 

^"'4 GGGTGGCCCA GTCGGGGCTG GCTGCCTTGG GAGTGCTGGT CTTNGCGNGC TNCNATTCGC 720 

^ CCTATAGTNA GNCGTA 736 

o 

y (2) INFORMATION FOR SEQ ID NO: 2: 

m 

s (i) SEQUENCE CHARACTERISTICS: 

f=| (A) LENGTH: 107 amino acids 

1^ (B) TYPE: amino acid 

Q (C) STRANDEDNESS : single 

Q (D) TOPOLOGY: linear 

Jt (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Pro Pro Pro Asn Tyr Asn Ser Xaa Pro Lys Xaa Xaa Xaa Leu Val 
1 5 . 10 15 

Leu Thr Pro Xaa Val Val Thr Val Ser Pro Pro Lys Tyr Met His Trp 
20 25 30 

Pro Glu His Leu lie Val Ala Val Pro He Arg He Asn Leu Val He 
35 40 45 

Leu Asn Lys Pro Lys Ala Leu Gin Thr Val Val Gin Leu Met Gly Glu 
50 55 60 

His Glu Leu Phe Glu Phe Trp Ser Gly His Tyr Lys Val His His He 
65 70 75 80 

Gly Trp Asn Gin Glu Lys Ala Thr Thr Val Leu Asn Ala Trp Gin Lys 
85 90 95 

Lys Phe Ala Gin Val Gly Gly Trp Arg Lys Glu 



26 




105 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGTCTGTCA CCCGGAGCCG GAGTCCCCGG CGGCCAGCAG CGTCCTCGCG AGCCGAGCGC 
CCAGGCGCGC CCGGAGCCCG CGGCGGCGGC GGCAACATGG CCTCGGCTGG TAACGCCGCC 
GGGGCCCTGG GCAGGCAGGC CGGCGGCGGG AGGCGCAGAC GGACCGGGGG ACCGCACCGC 
GCCGCGCCGG ACCGGGACTA TCTGCACCGG CCCAGCTACT GCGACGCCGC CTTCGCTCTG 
GAGCAGATTT CCAAGGGGAA GGCTACTGGC CGGAAAGCGC CGCTGTGGCT GAGAGCGAAG 
TTTCAGAGAC TCTTATTTAA ACTGGGTTGT TACATTCAAA AGAACTGCGG CAAGTTTTTG 
GTTGTGGGTC TCCTCATATT TGGGGCCTTC GCTGTGGGAT TAAAGGCAGC TAATCTCGAG 
ACCAACGTGG AGGAGCTGTG GGTGGAAGTT GGTGGACGAG TGAGTCGAGA ATTAAATTAT 
ACCCGTCAGA AGATAGGAGA AGAGGCTATG TTTAATCCTC AACTCATGAT ACAGACTCCA 
AAAGAAGAAG GCGCTAATGT TCTGACCACA GAGGCTCTCC TGCAACACCT GGACTCAGCA 
CTCCAGGCCA GTCGTGTGCA CGTCTACATG TATAACAGGC AATGGAAGTT GGAACATTTG 
TGCTACAAAT CAGGGGAACT TATCACGGAG ACAGGTTACA TGGATCAGAT AATAGAATAC 
CTTTACCCTT GCTTAATCAT TACACCTTTG GACTGCTTCT GGGAAGGGGC AAAGCTACAG 
TCCGGGACAG CATACCTCCT AGGTAAGCCT CCTTTACGGT GGACAAACTT TGACCCCTTG 
GAATTCCTAG AAGAGTTAAA GAAAATAAAC TACCAAGTGG ACAGCTGGGA GGAAATGCTG 
AATAAAGCCG AAGTTGGCCA TGGGTACATG GACCGGCCTT GCCTCAACCC AGCCGACCCA 
GATTGCCCTG CCACAGCCCC TAACAAAAAT TCAACCAAAC CTCTTGATGT GGCCCTTGTT 
TTGAATGGTG GATGTCAAGG TTTATCCAGG AAGTATATGC ATTGGCAGGA GGAGTTGATT 
GTGGGTGGTA CCGTCAAGAA TGCCACTGGA AAACTTGTCA GCGCTCACGC CCTGCAAACC 
ATGTTCCAGT TAATGACTCC CAAGCAAATG TATGAACACT TCAGGGGCTA CGACTATGTC 
TCTCACATCA ACTGGAATGA AGACAGGGCA GCCGCCATCC TGGAGGCCTG GCAGAGGACT 



TACGTGGAGG TGGTTCA^^AAGTGTCGCC CCAAACTCCA CTCAAAA GCTTCCCTTC 1320 

ACAACCACGA CCCTGGACGA CATCCTAAAA TCCTTCTCTG ATGTCAGTGT CATCCGAGTG 1380 

GCCACCGGCT ACCTACTGAT GCTTGCCTAT GCCTGTTTAA CCATGCTGCX; CTGGGACTGC 1440 

TCCAAGTCCC AGGGTGCCGT GGGGCTGGCT GGCGTCCTGT TGGTTGCGCT GTCAGTGGCT 1500 

GCAGGATTGG GCCTCTGCTC CTTGATTGGC ATTTCTTTTA ATGCTGCGAC AACTCAGGTT 1560 

TTGCCGTTTC TTGCTCTTGG TGTTGGTGTG GATGATGTCT TCCTCCTGGC CCATGCATTC 1620 

AGTGAAACAG GACAGAATAA GAGGATTCCA TTTGAGGACA GGACTGGGGA GTGCCTCAAG 1680 

CGCACCGGAG CCAGCGTGGC CCTCACCTCC ATCAGCAATG TCACCGCCTT CTTCATGGCC 1740 

GCATTGATCC CTATCCCTGC CCTGCGAGCX; TTCTCCCTCC AGGCTGCTGT GGTGGTGGTA 1800 

TTCAATTTTG CTATGGTTCT GCTCATTTTT CCTGCAATTC TCAGCATGGA TTTATACAGA 1860 

CGTGAGGACA GAAGATTGGA TATTTTCTGC TGTTTCACAA GCCCCTGTGT CAGCAGGGTG 1920 

ATTCAAGTTG AGCCACAGGC CTACACAGAG CCTCACAGTA ACACCCGGTA CAGCCCCCCA 1980 

CCCCCATACA CCAGCCACAG CTTCGCCCAC GAAACCCATA TCACTATGCA GTCCACCGTT 2040 

CAGCTCCGCA CAGAGTATGA CCCTCACACG CACGTGTACT ACACCACCGC CGAGCCACGC 2100 

TCTGAGATCT CTGTACAGCC TGTTACCGTC ACCCAGGACA ACCTCAGCTG TCAGAGTCCC 2160 

GAGAGCACCA GCTCTACCAG GGACCTGCTC TCCCAGTTCT CAGACTCCAG CCTCCACTGC 2220 

CTCGAGCCCC CCTGCACCAA GTGGACACTC TCTTCGTTTG CAGAGAAGCA CTATGCTCCT 2280 

TTCCTCCTGA AACCCAAAGC CAAGGTTGTG GTAATCCTTC TTTTCCTGGG CTTGCTGGGG 2340 

GTCAGCCTTT ATGGGACCAC CCGAGTGAGA GACGGGCTGG ACCTCACGGA CATTGTTCCC 2400 

CGGGAAACCA GAGAATATGA CTTCATAGCT GCCCAGTTCA AGTACTTCTC TTTCTACAAC 2460 

ATGTATATAG TCACCCAGAA AGCAGACTAC CCGAATATCC AGCACCTACT TTACGACCTT 2520 

CATAAGAGTT TCAGCAATGT GAAGTATGTC ATGCTGGAGG AGAACAAGCA ACTTCCCCAA 2580 

ATGTGGCTGC ACTACTTTAG AGACTGGCTT CAAGGACTTC AGGATGCATT TGACAGTGAC 2640 

TGGGAAACTG GGAGGATCAT GCCAAACAAT TATAAAAATG GATCAGATGA CGGGGTCCTC 2700 

GCTTACAAAC TCCTGGTGCA GACTGGCAGC CGAGACAAGC CCATCGACAT TAGTCAGTTG 27 60 

ACTAAACAGC GTCTGGTAGA CGCAGATGGC ATCATTAATC CGAGCGCTTT CTACATCTAC 2820 

CTGACCGCTT GGGTCAGCAA CGACCCTGTA GCTTACGCTG CCTCCCAGGC CAACATCCGG 2880 

CCTCACCGGC CGGAGTGGGT CCATGACAAA GCCGACTACA TGCCAGAGAC CAGGCTGAGA 2940 

ATCCCAGCAG CAGAGCCCAT CGAGTACGCT CAGTTCCCTT TCTACCTCAA CGGCCTACGA 3000 
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GACACCTCAG ACTTTGTC^^^GCCATAGAA AAAGTGAGAG TCATCTG CAACTATACG 3060 

AGCCTGGGAC TGTCCAGCTA CCCCAATGGC TACCCCTTCC TGTTCTGGGA GCAATACATC 3120 
AGCCTGCGCC ACTGGCTGCT GCTATCCATC AGCGTGGTGC TGGCCTGCAC GTTTCTAGTG 3180 
TGCGCAGTCT TCCTCCTGAA CCCCTGGACG GCCGGGATCA TTGTCATGGT CCTGGCTCTG 3240 
ATGACCGTTG AGCTCTTTGG CATGATGGGC CTCATTGGGA TCAAGCTGAG TGCTGTGCCT 3300 

GTGGTCATCC TGATTGCATC TGTTGGCATC GGAGTGGAGT TCACCXJTCCA CGTGGCTTTG 3360 

GCCTTTCTGA CAGCCATTGG GGACAAGAAC CACAGGGCTA TGCTCGCTCT GGAACACATG 3420 

TTTGCTCCCG TTCTGGACGG TGCTGTGTCC ACTCTGCTGG GTGTACTGAT GCTTGCAGGG 3480 

TCCGAATTTG ATTTCATTGT CAGATACTTC TTTGCCGTCC TGGCCATTCT CACCGTCTTG 3540 

GGGGTTCTCA ATGGACTGGT TCTGCTGCCT GTCCTCTTAT CCTTCTTTGG ACCGTGTCCT 3600 

GAGGTGTCTC CAGCCAATGG CCTAAACCGA CTGCCCACTC CTTCGCCTGA GCCGCCTCCA 3660 

AGTGTCGTCC GGTTTGCCGT GCCTCCTGGT CACACGAACA ATGGGTCTGA TTCCTCCGAC 3720 

TCGGAGTACA GCTCTCAGAC CACGGTGTCT GGCATCAGTG AGGAGCTCAG GCAATACGAA 3780 

GCACAGCAGG GTGCCGGAGG CCCTGCCCAC CAAGTGATTG TGGAAGCCAC AGAAAACCCT 3840 

GTCTTTGCCC GGTCCACTGT GGTCCATCCG GACTCCAGAC ATCAGCCTCC CTTGACCCCT 3900 

CGGCAACAGC CCCACCTGGA CTCTGGCTCC TTGTCCCCTG GACGGCAAGG CCAGCAGCCT 3960 

CGAAGGGATC CCCCTAGAGA AGGCTTGCGG CCACCCCCCT ACAGACCGCG CAGAGACGCT 4020 

TTTGAAATTT CTACTGAAGG GCATTCTGGC CCTAGCAATA GGGACCGCTC AGGGCCCCGT 4080 

GGGGCCCGTT CTCACAACCC TCGGAACCCA ACGTCCACCG CCATGGGCAG CTCTGTGCCC 4140 

AGCTACTGCC AGCCCATCAC CACTGTGACG GCTTCTGCTT CGGTGACTGT TGCTGTGCAT 4200 

CCCCCGCCTG GACCTGGGCG CAACCCCCGA GGGGGGCCCT GTCCAGGCTA TGAGAGCTAC 4260 

CCTGAGACTG ATCACGGGGT ATTTGAGGAT CCTCATGTGC CTTTTCATGT CAGGTGTGAG 4320 

AGGAGGGACT CAAAGGTGGA GGTCATAGAG CTACAGGACG TGGAATGTGA GGAGAGGCCG 4380 

TGGGGGAGCA GCTCCAACTG AGGGTAATTA AAATCTGAAG CAAAGAGGCC AAAGATTGGA 4440 

AAGCCCCGCC CCCACCTCTT TCCAGAACTG CTTGAAGAGA ACTGCTTGGA ATTATGGGAA 4500 

GGCAGTTCAT TGTTACTGTA ACTGATTGTA TTATTKKGTG AAATATTTCT ATAAATATTT 4560 

AARAGGTGTA CACATGTAAT ATACATGGAA ATGCTGTACA GTCTATTTCC TGGGGCCTCT 4620 

CCACTCCTGC CCCAGAGTGG GGAGACCACA GGGGCCCTTT CCCCTGTGTA CATTGGTCTC 4680 

TGTGCCACAA CCAAGCTTAA CTTAGTTTTA AAAAAAATCT CCCAGCATAT GTCGCTGCTG 4740 
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CTTAAATATT GTATAAn^^nTTGTATAAT TCTATGCAAA TATTGCTE^^TAATAGGAT 4800 

TATTTGTAAA GGTTTCTGTT TAAAATATTT TAAATTTGCA TATCACAACC CTGTGGTAGG 4860 

ATGAATTGTT ACTGTTAACT TTTGAACACG CTATGCGTGG TAATTGTTTA ACGAGCAGAC 4920 

ATGAAGAAAA CAGGTTAATC CCAGTGGCTT CTCTAGGGGT AGTTGTATAT GGTTCGCATG 4980 

GGTGGATGTG TGTGTGCATG TGACTTTCCA ATGTACTGTA TTGTGGTTTG TTGTTGTTGT 5040 

TGCTGTTGTT GTTCATTTTG GTGTTTTTGG TTGCTrTGTA TGATCTTAGC TCTGGCCTAG 5100 

GTGGGCTGGG AAGGTCCAGG TCTTTTTCTG TCGTGATGCT GGTGGAAAGG TGACCCCAAT 5160 

CATCTGTCCT ATTCTCTGGG ACTATTC 5187 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 amino acids 

(B) TYPE: amino acid 

''U (C) STRANDEDNESS : single 

J (D) TOPOLOGY: linear 



(ii) MOLECTOE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Ala Pro Asp Ser Glu Ala Pro Ser Asn Pro Arg He Thr Ala 
15 10 15 

Ala His Glu Ser Pro Cys Ala Thr Glu Ala Arg His Ser Ala Asp Leu 
20 25 30 

Tyr He Arg Thr Ser Trp Val Asp Ala Ala Leu Ala Leu Ser Glu Leu 
35 40 45 

Glu Lys Gly Asn He Glu Gly Gly Arg Thr Ser Leu Trp He Arg Ala 
50 55 60 

Trp Leu Gin Glu Gin Leu Phe He Leu Gly Cys Phe Leu Gin Gly Asp 
65 70 75 80 

Ala Gly Lys Val Leu Phe Val Ala He Leu Val Leu Ser Thr Phe Cys 
85 90 95 

Val Gly Leu Lys Ser Ala Gin He His Thr Arg Val Asp Gin Leu Trp 
100 105 110 

Val Gin Glu Gly Gly Arg Leu Glu Ala Glu Leu Lys Tyr Thr Ala Gin 
115 120 125 

Ala Leu Gly Glu Ala Asp Ser Ser Thr His Gin Leu Val He Gin Thr 
130 135 140 
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Ala Lys Asp P^^sp Val Ser Leu Leu His Pro Gl a Leu Leu Glu 

150 155 160 

His Leu Lys Val Val His Ala Ala Thr Arg Val Thr Val His Met Tyr 
165 170 175 

Asp lie Glu Trp Arg Leu Lys Asp Leu Cys Tyr Ser Pro Ser lie Pro 
180 185 190 

Asp Phe Glu Gly Tyr His His lie Glu Ser lie He Asp Asn Val He 
195 200 205 

Pro Cys Ala He He Thr Pro Leu Asp Cys Phe Trp Glu Gly Ser Lys 
210 215 220 

Leu Leu Gly Pro Asp Tyr Pro He Tyr Val Pro His Leu Lys His Lys 
225 230 235 240 

Leu Gin Trp Thr His Leu Asn Pro Leu Glu Val Val Glu Glu Val Lys 
245 250 255 

Lys Leu Lys Phe Gin Phe Pro Leu Ser Thr He Glu Ala Tyr Met Lys 
260 265 270 

Arg Ala Gly He Thr Ser Ala Tyr Met Lys Lys Pro. Cys Leu Asp Pro 
275 280 285 

Thr Asp Pro His Cys Pro Ala Thr Ala Pro Asn Lys Lys Ser Gly His 
290 295 300 

He Pro Asp Val Ala Ala Glu Leu Ser His Gly Cys Tyr Gly Phe Ala 

310 315 320 

Ala Ala Tyr Met His Trp Pro Glu Gin Leu He Val Gly Gly Ala Thr 
325 330 335 

Arg Asn Ser Thr Ser Ala Leu Arg Lys Ala Arg Xaa Leu Gin Thr Val 
340 345 350 

Val Gin Leu Met Gly Glu Arg Glu Met Tyr Glu Tyr Trp Ala Asp His 
355 360 365 

Tyr Lys Val His Gin He Gly Trp Asn Gin Glu Lys Ala Ala Ala Val 
370 375 380 

Leu Asp Ala Trp Gin Arg Lys Phe Ala Ala Glu Val Arg Lys He Thr 
385 390 395 400 

Thr Ser Gly Ser Val Ser Ser Ala Tyr Ser Phe Tyr Pro Phe Ser Thr 
405 410 415 

Ser Thr Leu Asn Asp He Leu Gly Lys Phe Ser Glu Val Ser Leu Lys 
420 425 430 

Asn He He Leu Gly Tyr Met Phe Met Leu He Tyr Val Ala Val Thr 
435 440 445 

Leu He Gin Trp Arg Asp Pro II Arg Ser Gin Ala Gly Val Gly He 
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450 



455 4 



Ala Gly Val Leu Leu Leu Ser lie Thr Val Ala Ala Gly Leu Gly Phe 

470 475 

Cys Ala Leu Leu Gly He Pro Phe Asn Ala Ser Ser Thr Gin He Val 
485 490 

Pro Phe Leu Ala Leu Gly Leu Gly Val Gin Asp Met Phe Leu Leu Thr 
SOO SOS 510 

Hia Thr Tyr Val Glu Gin Ala Gly Asp Val Pro Arg Glu Glu Arg Thr 
SIS 520 525 

Gly Leu Val Leu Lys Lys Ser Gly Leu Ser Val Leu Leu Ala Ser Leu 
530 535 



Cys Asn val Met Ala Phe Leu Ala Ala Ala Leu Leu Pro He Pro 



545 



550 555 



Ala 
560 



Leu 



Phe Arg Val Phe Cys Leu Gin Ala Ala He Leu Leu Leu Phe Asn 
565 570 

Gly Ser He Leu Leu Val Phe Pro Ala Met He Ser Leu Asp Leu Arg 

585 590 

Arg Arg Ser Ala Ala Arg Ala Asp Leu Leu Cys Cys Leu Met Pro Glu 
595 600 60S 

^^'^ «Tn '^^^ Ala Lys Thr Arg Lys 

"° 515 620 

Asn Asp Lys Thr His Arg He Asp Thr Thr Arg Gin Pro Leu Asp Pro 

530 635 640 

Asp val ser Glu Asn Val Thr Lys Thr Cys Cys Leu Ser Val Ser Leu 
645 6S0 655 

Thr Lys Trp Ala Lys Asn Gin Tyr Ala Pro Phe He Met Arg Pro Ala 
550 665 670 

val Lys Val Thr Ser Met Leu Ala Leu He Ala Val He Leu Thr Ser 
575 680 685 

Val Trp Gly Ala Thr Lys Val Lys Asp Gly Leu Asp Leu Thr Asp He 
590 695 700 



Val Pro Glu Asn Thr Asp Glu His Glu 
705 710 7^5 



Phe Leu Ser Arg Gin Glu Lys 

720 

Tyr Phe Gly Phe Tyr Asn Met Tyr Ala Val Thr Gin Gly Asn Phe Glu 
725 730 

Tyr Pro Thr Asn Gin Lys Leu Leu Tyr Glu Tyr His Asp Gin Phe Val 
740 745 

Arg He Pro Asn He He Lys Asn Asp Asn Gly Gly Leu Thr Lys Phe 
755 760 765 
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Trp Leu Ser ^^^he Arg Asp Trp Leu Leu Asp L^^n Val Ala Phe 
770 ^ 775 



Asp Lys Glu Val Ala Ser Gly Cys II Thr Gin Glu Tyr Trp Cys Lvs 

790 795 qOO 

Asn Ala Ser Asp Glu Gly He Leu Ala Tyr Lys Leu Met Val Gin Thr 
805 810 815 

Gly His Val Asp Asn Pro He Asp Lys Ser Leu He Thr Ala Gly His 
820 825 830 

Arg Leu Val Asp Lys Asp Gly He He Asn Pro Lys Ala Phe Tyr Asn 
835 840 845 

Tyr Leu Ser Ala Trp Ala Thr Asn Asp Ala Leu Ala Tyr Gly Ala Ser 
850 855 860 

Gin Gly Asn Leu Lys Pro Gin Pro Gin Arg Trp He His Ser Pro Glu 

870 875 880 

Asp Val His Leu Glu He Lys Lys Ser Ser Pro Leu He Tyr Thr Gin 
885 890 895 

Leu Pro Phe Tyr Leu Ser Gly Leu Ser Asp Thr. Xaa Ser He Lys Thr 
900 905 910 

Leu He Arg Ser Val Arg Asp Leu Cys Leu Lys Tyr Glu Ala Lys Gly 
915 920 925 

Leu Pro Asn Phe Pro Ser Gly He Pro Phe Leu Phe Trp Glu Gin Tyr 
930 935 

Leu Tyr Leu Arg Thr Ser Leu Leu Leu Ala Leu Ala Cys Ala Leu Ala 

950 955 960 

Ala Val Phe He Ala Val Met Val Leu Leu Leu Asn Ala Trp Ala Ala 
965 970 975 

Val Leu Val Thr Leu Ala Leu Ala Thr Leu Val Leu Gin Leu Leu Gly 
980 985 990 

Val Met Ala Leu Leu Gly Val Lys Leu Ser Ala Met Pro Ala Val Leu 
995 1000 1005 

Leu Val Leu Ala He Gly Arg Gly Val His Phe Thr Val His Leu Cys 
1010 1015 1020 

Leu Gly Phe Val Thr Ser He Gly Cys Lys Arg Arg Arg Ala Ser Leu 
^025 1030 1035 1040 

Ala Leu Glu Ser Val Leu Ala Pro Val Val His Gly Ala Leu Ala Ala 
1045 1050 1055 

Ala Leu Ala Ala Ser Met Leu Ala Ala Ser Glu Cys Gly Phe Val Ala 
1060 1065 1070 

Arg Leu Phe Leu Arg Leu Leu Leu Asp He Val Phe Leu Gly Leu He 
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? 3 H 



1080 



^^08S 



JiL''"" '"''^ Ala 

1095 1100 

Ala^Glu Val Arg Pro lle^Glu His Pr Glu Arg^^eu Ser Thr Pro Ser 

Pro Lys cys ser Pr lie His Pro Arg Lys Ser Ser Ser Ser Ser Cly 
1125 1130 ^^35 

Gly Gly Asp Lys ser Ser Arg Thr Ser Lys Ser Ala Pro Arg Pro Cys 
11*0 1145 1150 

Ala Pro ser Leu Thr Thr He Thr Glu Glu Pro Ser Ser Trp His Ser 
11S5 1160 1165 

ser Ala His Ser Val Gin Ser Ser Met Gin Ser He Val Val Gin Pro 

1175 1180 



U1 h'ss'''' TJfo^^" ^'^^ ^1- 

i 1190 1195 1200 



Gly Arg Ser Thr Pro Thr Lys Ser Ser His Gly Gly Ala He Thr Thr 
1205 1210 1215 

Thr Lys Val Thr Ala Thr Ala Asn He Lys Val Glu Val Val 



1220 1225 



Thr Pro 
1230 



a ser Asp Arg Lys Ser Arg Arg Ser Tyr His Tyr Tyr Asp Arg Arg Arg 

W ^^^^ 1240 1245 

S "^^^ ilV"^ """^ ""^^ ^'^^ ""^^ ^5 Asp Arg ASP Arg 

1255 1260 

Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg Asp Arg 

1275 1280 

Glu Arg Ser Arg Glu Arg Asp Arg Arg Asp Arg Tyr Arg Asp Glu Arg 
1285 1290 1295 

Asp His Arg Ala Ser Pro Arg Glu Lys Arg Gin Arg Phe Trp Thr 
1300 1305 1310 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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CGAAACAAGA GAGCGAGTGA TAGGGAG AGCGTCTGTG TTGTGTGTTG ^^STCGCCC 60 

ACGCACACAG GCGCAAAACA GTGCACACAG ACGCCCGCTG GGCAAGAGAG AGTGAGAGAG 120 

AGAAACAGCG GCGCGCGCTC GCCTAATGAA GTTGTTGGCC TGGCTGGCGT GCCGCATCCA 180 

CGAGATACAG ATACATCTCT CATGGACCGC GACAGCCTCC CACGCGTTCC GGACACACAC 240 

GGCGATGTGG TCGATGAGAA ATTATTCTCG GATCTTTACA TACGCACCAG CTGGGTGGAC 300 

GCCCAAGTGG CGCTCGATCA GATAGATAAG GGCAAAGCGC GTGGCAGCCG CACGGCGATC 360 

TATCTGCGAT CAGTATTCCA GTCCCACCTC GAAACCCTCG GCAGCTCCGT GCAAAAGCAC 420 

GCGGGCAAGG TGCTATTCGT GGCTATCCTG GTGCTGAGCA CCTTCTGCGT CGGCCTGAAG 480 

^ AGCGCCCAGA TCCACTCCAA GGTGCACCAG CTGTGGATCC AGGAGGGCGG CCGGCTGGAG 540 

I s 

. 

GCGGAACTGG CCTACACACA GAAGACXSATC GGCGAGGACG AGTCGGCCAC GCATCAGCTG 600 

"•-.3 

CTCATTCAGA CGACCCACGA CCCGAACGCC TCCGTCCTGC ATCCGCAGGC GCTGCTTGCC 660 

ZT 

O CACCTGGAGG TCCTGGTCAA GGCCACCGCC GTCAAGGTGC ACCTCTACGA CACCGAATGG 720 

w 

rd GGGCTGCGCG ACATGTGCAA CATGCCGAGC ACGCCCTCCT TCGAGGGCAT CTACTACATC 780 

O GAGCAGATCC TGCGCCACCT CATTCCGTGC TCGATCATCA CGCCGCTGGA CTGTTTCTGG 840 

H GAGGGAAGCC AGCTGTTGGG TCCGGAATCA GCGGTCGTTA TACCAGGCCT CAACCAACGA 900 

CTCCTGTGGA CCACCCTGAA TCCCGCCTCT GTGATGCAGT ATATGAAACA AAAGATGTCC 960 

GAGGAAAAGA TCAGCTTCGA CTTCGAGACC GTGGAGCAGT ACATGAAGCG TGCGGCCATT 1020 

GGCAGTGGCT ACATGGAGAA GCCCTGCCTG AACCCACTGA ATCCCAATTG CCCGGACACG 1080 

GCACCGAACA AGAACAGCAC CCAGCCGCCG GATGTGGGAG CCATCCTGTC CGGAGGCTGC 1140 

TACGGTTATG CCGCGAAGCA CATGCACTGG CCGGAGGAGC TGATTGTGGG CGGACGGAAG 1200 

AGGAACCGCA GCGGACACTT GAGGAAGGCC CAGGCCCTGC AGTCGGTGGT GCAGCTGATG 1260 

ACCGAGAAGG AAATGTACGA CCAGTGGCAG GACAACTACA AGGTGCACCA TCTTGGATGG 1320 

ACGCAGGAGA AGGCAGCGGA GGTTTTGAAC GCCTGGCAGC GCAACTTTTC GCGGGAGGTG 1380 

GAACAGCTGC TACGTAAACA GTCGAGAATT GCCACCAACT ACGATATCTA CGTGTTCAGC 1440 

TCGGCTGCAC TGGATGACAT CCTGGCCAAG TTCTCCCATC CCAGCGCCTT GTCCATTGTC 1500 

ATCGGCGTGG CCGTCACCGT TTTGTATGCC TTTTGCACGC fcCTCCGCTG GAGGGACCCC 1560 

GTCCGTGGCC AGAGCAGTGT GGGCGTGGCC GGAGTTCTGC TCATGTGCTT CAGTACCGCC 1620 

GCCGGATTGG GATTGTCAGC CCTGCTCGGT ATCGTTTTCA ATGCGCTGAC CGCTGCCTAT 1680 

GCGGAGAGCA ATCGGCGGGA GCAGACCAAG CTGATTCTCA AGAACGCCAG CACCCAGGTG 1740 



LiJ 



35 



GTTCCGTTTT TGGCCCa|||^ TCTGGGCGTC GATCACATCT TCATAGMK ACCGAGCATC 
CTGTTCAGTG CCTGCAGCAC CGCAGGATCC TTCTTTGCGG CCGCCTTSfx TCCGGTGCCG 
GCTTTGAAGG TATTCTGTCT GCAGGCTGCC ATCGTAATGT GCTCCAATTT GGCAGCGGCT 
CTATTGGTTT TTCCGGCCAT GATTTCGTTG GATCTACGGA GACGTACCGC CGGCAGGGCG 
GACATCTTCT GCTGCTGTTT TCCGGTGTGG AAGGAACAGC CGAAGGTGGC ACCTCCGGTG 
CTGCCGCTGA ACAACAACAA CGGGCGCGGG GCCCGGCATC CGAAGAGCTG CAACAACAAC 
AGGGTGCCGC TGCCCGCCCA GAATCCTCTG CTGGAACAGA GGGCAGACAT CCCTGGGAGC 
AGTCACTCAC TGGCGTCCTT CTCCCTGGCA ACCTTCGCCT TTCAGCACTA CACTCCCTTC 
CTCATGCGCA GCTGGGTGAA GTTCCTGACC GTTATGGGTT TCCTGGCGGC CCTCATATCC 
AGCTTGTATG CCTCCACGCG CCTTCAGGAT GGCCTGGACA TTATTGATCT GGTGCCCAAG 
GACAGCAACG AGCACAAGTT CCTGGATGCT CAAACTCGGC TCTTTGGCTT CTACAGCATG 
TATGCGGTTA CCCAGGGCAA CTTTGAATAT CCCACCCAGC AGCAGTTGCT CAGGGACTAC 
CATGATTCCT TTGTGCGGGT GCCACATGTG ATCAAGAATG ATAACGGTGG ACTGCCGGAC 
TTCTGGCTGC TGCTCTTCAG CGAGTGGCTG GGTAATCTGC AAAAGATATT CGACGAGGAA 
TACCGCGACG GACGGCTGAC CAAGGAGTGC TGGTTCCCAA ACGCCAGCAG CGATGCCATC 
CTGGCCTACA AGCTAATCGT GCAAACCGGC CATGTGGACA ACCCCGTGGA CAAGGAACTG 
GTGCTCACCA ATCGCCTGGT CAACAGCGAT GGCATCATCA ACCAACGCGC CTTCTACAAC 
TATCTGTCGG CATGGGCCAC CAACGACGTC TTCGCCTACG GAGCTTCTCA GGGCAAATTG 
TATCCGGAAC CGCGCCAGTA TTTTCACCAA CCCAACGAGT ACGATCTTAA GATACCCAAG 
AGTCTGCCAT TGGTCTACGC TCAGATGCCC TTTTACCTCC ACGGACTAAC AGATACCTCG 
CAGATCAAGA CCCTGATAGG TCATATTCGC GACCTGAGCG TCAAGTACGA GGGCTTCGGC 
CTGCCCAACT ATCCATCGGG CATTCCCTTC ATCTTCTGGG AGCAGTACAT GACCCTGCGC 
TCCTCACTGG CCATGATCCT GGCCTGCGTG CTACTCGCCG CCCTGGTGCT GGTCTCCCTG 
CTCCTGCTCT CCGTTTGGGC CGCCGTTCTC GTGATCCTCA GCGTTCTGGC CTCGCTGGCC 
CAGATCTTTG GGGCCATGAC TCTGCTGGGC ATCAAACTCT CGGCCATTCC GGCAGTCATA 
CTCATCCTCA GCGTGGGCAT GATGCTGTGC TTCAATGTGC TGATATCACT GGGCTTCATG 
ACATCCGTTG GCAACCGACA GCGCCGCGTC CAGCTGAGCA TGCAGATGTC CCTGGGACCA 
CTTGTCCACG GCATGCTGAC CTCCGGAGTG GCCGTGTTCA TGCTCTCCAC GTCGCCCTTT 
GAGTTTGTGA TCCGGCACTT CTGCTGGCTT CTGCTGGTGG TCTTATGCGT TGGCGCCTGC 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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AACAGCCTTT TGGTGT^J^ CATCCTACTG AGCATGGTGG GACCG^KC GGAGCTGGTG 3540 

CCGCTGGAGC ATCCAGACCG CATATCCACX; CCCTCTCCGC TGCCCGTGCG CAGCAGCAAG 3600 

AGATCGGGCA AATCCTATGT GGTGCAGGGA TCGCGATCCT CGCGAGGCAG CTGCCAGAAG 3660 

TCGCATCACC ACCACCACAA AGACCTTAAT GATCCATCGC TGACGACGAT CACCGAGGAG 3720 

CCGCAGTCGT GGAAGTCCAG CAACTCGTCC ATCCAGATGC CCAATGATTG GACCTACCAG 3780 

CCGCGGGAAC AGCGACCCGC CTCCTACGCG GCCCCGCCCC CCGCCTATCA CAAGGCCGCC 3840 

GCCCAGCAGC ACCACCAGCA TCAGGGCCCG CCCACAACXSC CCCCGCCTCC CTTCCCGACG 3900 

GCCTATCCGC CGGAGCTGCA GAGCATCGTG GTGCAGCCGG AGGTGACGGT GGAGACGACG 3960 

CACTCGGACA GCAACACCAC CAAGGTGACXS GCCACGGCCA ACATCAAGGT GGAGCTGGCC 4020 

ATGCCCGGCA GGGCGGTGCG CAGCTATAAC TTTACGAGTT AGCACTAGCA CTAGTTCCTG 4080 

TAGCTATTAG GACGTATCTT TAGACTCTAG CCTAAGCCGT AACCCTATTT GTATCTGTAA 4140 

AATCGATTTG TCCAGCGGGT CTGCTGAGGA TTTCGTTCTC ATGGATTCTC ATGGATTCTC 4200 

3 

^ ATGGATGCTT AAATGGCATG GTAATTGGCA AAATATCAAT TTTTGTGTCT CAAAAAGATG 4260 

CATTAGCTTA TGGTTTCAAG ATACATTTTT AAAGAGTCCG CCAGATATTT ATATAAAAAA 4320 

AATCCAAAAT CGACGTATCC ATGAAAATTG AAAAGCTAAG CAGACCCGTA TGTATGTATA 4380 

TGTGTATGCA TGTTAGTTAA TTTCCCGAAG TCCGGTATTT ATAGCAGCTG CCTT 4434 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Arg Asp Ser Leu Pro Arg Val Pro Asp Thr His Gly Asp Val 
15 10 15 

Val Asp Glu Lys Leu Phe Ser Asp Leu Tyr lie Arg Thr Ser Trp Val 
20 25 30 

Asp Ala Gin Val Ala Leu Asp Gin lie Asp Lys Gly Lys Ala Arg Gly 
35 40 45 

Ser Arg Thr Ala lie Tyr Leu Arg Ser Val Phe Gin Ser His Leu Glu 
50 55 60 
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Thr Leu Gly Ser Val Gin Lys His Ala Gly 3^^al Leu Phe Val 

^5 70 75 80 

Ala lie Leu Val Leu Ser Thr Phe Cys Val Gly Leu Lys Ser Ala Gin 
85 90 95 

lie His Ser Lys Val His Gin Leu Trp He Gin Glu Gly Gly Arg Leu 
100 105 110 

Glu Ala Glu Leu Ala Tyr Thr Gin Lys Thr He Gly Glu Asp Glu Ser 
115 120 125 

Ala Thr His Gin Leu Leu He Gin Thr Thr His Asp Pro Asn Ala Ser 
"0 135 3^40 

Val Leu His Pro Gin Ala Leu Leu Ala His Leu Glu Val Leu Val Lys 

ISO 155 160 

Ala Thr Ala Val Lys Val His Leu Tyr Asp Thr Glu Trp Gly Leu Arg 
165 170 175 

Asp Met Cys Asn Met Pro Ser Thr Pro Ser Phe Glu Gly He Tyr Tyr 
ISO 185 190 

He Glu Gin He Leu Arg His Leu He Pro Cys Ser He He Thr Pro 
195 200 205 

Leu Asp Cys Phe Trp Glu Gly Ser Gin Leu Leu Gly Pro Glu Ser Ala 
210 215 220 

Val Val He Pro Gly Leu Asn Gin Arg Leu Leu Trp Thr Thr Leu Asn 
225 230 235 240 

Pro Ala Ser Val Met Gin Tyr Met Lys Gin Lys Met Ser Glu Glu Lys 
245 250 255 

He Ser Phe Asp Phe Glu Thr Val Glu Gin Tyr Met Lys Arg Ala Ala 
260 265 270 

He Gly Ser Gly Tyr Met Glu Lys Pro Cys Leu Asn Pro Leu Asn Pro 
275 280 285 

Asn Cys Pro Asp Thr Ala Pro Asn Lys Asn Ser Thr Gin Pro Pro Asp 
290 295 300 

Val Gly Ala He Leu Ser Gly Gly Cys Tyr Gly Tyr Ala Ala Lys His 

310 315 320 

Met His Trp Pro Glu Glu Leu He Val Gly Gly Arg Lys Arg Asn Arg 
325 330 335 

Ser Gly His Leu Arg Lys Ala Gin Ala Leu Gin Ser Val Val Gin Leu 
340 345 350 

Met Thr Glu Lys Glu Met Tyr Asp Gin Trp Gin Asp Asn Tyr Lys Val 
355 360 365 



His His Leu Gly Trp Thr Gin Glu Lys Ala Ala Glu Val 



Leu Asn Ala 
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370 375 



Trp Gin Arg Asn Phe Ser Arg Glu Val Glu Gin Leu Leu Arg Lys Gin 

390 395 

Ser Arg He Ala Thr Asn Tyr Asp He Tyr Val Phe Ser Ser Ala Ala 
405 410 415 

Leu Asp Asp lie Leu Ala Lys Ph Ser His Pro Ser Ala Leu Ser He 
420 425 430 

val lie Gly Val Ala Val Thr Val Leu Tyr Ala Phe Cys Thr Leu Leu 
435 440 445 

Arg Trp Arg Asp Pro Val Arg Gly Gin Ser Ser Val Gly Val Ala Glv 
450 455 4gQ 

Val Leu Leu Met Cys Phe Ser Thr Ala Ala Gly Leu Gly Leu Ser Ala 

^'O 475 480 

Leu Leu Gly He Val Phe Asn Ala Leu Thr Ala Ala Tyr Ala Glu Ser 
485 490 495 

Asn Arg Arg Glu Gin Thr Lys Leu He Leu Lys Asn Ala Ser Thr Gin 
500 505 510 

Val Val Pro Phe Leu Ala Leu Gly Leu Gly Val Asp His He Phe He 
515 520 525 

Val Gly Pro ser He Leu Phe Ser Ala Cys Ser Thr Ala Gly Ser Phe 
530 535 

Phe Ala Ala Ala Phe He Pro Val Pro Ala Leu Lys Val Phe Cys Leu 

545 S50 ece 

555 560 

Gin Ala Ala He Val Met Cys Ser Asn Leu Ala Ala Ala Leu Leu Val 
565 570 

Phe Pro Ala Met He Ser Leu Asp Leu Arg Arg Arg Thr Ala Gly Arg 
580 585 590 

Ala Asp He Phe Cys Cys Cys Phe Pro Val Trp Lys Glu Gin Pro Lys 
595 600 605 

Val Ala Pro Pro Val Leu Pro Leu Asn Asn Asn Asn Gly Arg Gly Ala 
^10 615 620 

Arg His Pro Lys Ser Cys Asn Asn Asn Arg Val Pro Leu Pro Ala Gin 



^25 630 635 



640 



Asn Pro Leu Leu Glu Gin Arg Ala Asp He Pro Gly Ser Ser His Ser 
^45 650 655 

Leu Ala ser Phe Ser Leu Ala Thr Phe Ala Phe Gin His Tyr Thr Pro 
^50 665 670 

Phe Leu Met Arg Ser Trp Val Lys Phe Leu Thr Val Met Gly Phe Leu 
^■75 680 685 
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Ala Ala Leu Z^er Ser Leu Tyr Ala Ser Thr A^^u Gin Asp Gly 

Leu Asp lie He Asp Leu Val Pro Lys Asp Ser Asn Glu His Lys Phe 

710 715 ^ 

Leu Asp Ala Gin Thr Arg Leu Phe Gly Phe Tyr Ser Met Tyr Ala Val 
725 730 

Thr Gin Gly Asn Phe Glu Tyr Pro Thr Gin Gin Gin Leu Leu Arg Asp 

745 750 ^ ^ 

Tyr His Asp Ser Phe Arg Val Pro His Val He Lys Asn Asp Asn Gly 
755 7fin ' 



765 



Gly Leu Pro Asp Phe Trp Leu Leu Leu Phe Ser Glu Trp Leu Gly Asn 

775 780 

Leu Gin Lys He Phe Asp Glu Glu Tyr Arg Asp Gly Arg Leu Thr Lys 

795 800 

Glu Cys Trp Phe Pro Asn Ala Ser Ser Asp Ala He Leu Ala Tyr Lys 
805 810 815 

Leu He Val Gin Thr Gly His Val Asp Asn 



820 825 



Pro Val Asp Lys Glu Leu 
830 



val Leu Thr Asn Arg Leu Val Asn Ser Asp Gly He He Asn Gin Arg 
835 840 845 

Ala Phe Tyr Asn Tyr Leu Ser Ala Trp Ala Thr Asn Asp Val Phe Ala 

855 860 

Tyr Gly Ala Ser Gin Gly Lys Leu Tyr Pro Glu Pro Arg Gin Tyr Phe 

875 880 
His Gin Pro Asn Glu Tyr Asp Leu Lys He Pro Lys Ser Leu Pro Leu 



885 890 



895 



val Tyr Ala Gin Met Pro Phe Tyr Leu His Gly Leu Thr Asp Thr Ser 
900 

Gin He Lys Thr Leu He Gly His He Arg Asp Leu Ser Val Lys Tyr 

920 925 

Glu Gly Phe Gly Leu Pro Asn Tyr Pro Ser Gly He Pro Phe He Phe 
"0 935 

Trp Glu Gin Tyr Met Thr Leu Arg Ser Ser Leu Ala Met He Leu Ala 

955 960 

cys val Leu Leu Ala Ala Leu Val Leu Val Ser Leu Leu Leu Leu Ser 
985 970 g^g 

val Trp Ala Ala Val Leu Val He Leu Ser Val Leu Ala Ser Leu Ala 

980 985 ggQ 

Gin He Phe Gly Ala Met Thr Leu Leu Gly He Lys Leu Ser Ala He 
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Pro »l.^vax XI. .eu xle ^„ s.. v.l cl, H . «e. ^„ cy. P.e .3„ 

-^-"-^^ 1020 
v.l^I.eu n. ser «^^p,e „^ ^„ 

1040 



V.X ^„ 

1055 

Het .eu xn. s,.^ox>, v,x v,x .He h« ..u s., THr Se, p.. 

1070 

OXU v.x^xx, ^, Hi, P., ^„ ^„ ^„ 

-^"^O 1085 

Sjo"* *•» ^' JJ" — V.1 p.. pro XX, u.„ x..„ s„ He. 

1100 

vex^=x, P„ oxu «. c.u^xeu VeX Pro Xe„ oXu^„x, p„ 

r.. P.O ser P.o^.,„ p„ j;;^^ ^^^^JJJ 

... T., vex vex „x„ cx, s.r s« se. ox. c„ 

XX50 

„X. HXe^Hl. HXe HXe « ^ 

1165 

He Thr Glu Glu Pro Gin Ser Trp Lys Ser Se. a . 

laVO P Ser Asn Ser Ser lie Gin 

^■^'^ 1180 

?XSS-" ■'■^ - A., OXU 0X„ Ar, P.o se. 

X200 

ryr «e AXe Pro P„ p„ ,xe Ty. „x. «e M. OX„ OX„ HXs 

1210 j^215 
His Gin His Gin Gly Pro Pro Th*- « ' 

1220 Pro Phe Pro Thr 

AXe P.,^p.o OX„ .e„ OX„ Se. XL Vex VeX OX„ P„ OXu V.X THr 

-"■^^^ 1245 



vex oXu^Th. Thr Hi. s« Aep Ser Ae„ T.r Thr .y, v,x Thr AXe Thr 

-^^^^ 1260 
Ala^Asn Xle Lys Val Glu Leu Ala Met Pro Gly Arg Ala Val 



Tyr Asn Phe Thr Ser 
1285 

INFORMATION FOR SEQ ID NO: 7: 



1270 

■^^'^ 1280 



(i) SEQUENCE ^^^CTERISTICS: 

(A) LEH^K 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOI*ECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 





AAGGTCCATC 


AGCTTTGGAT 


ACAGGAAGGT 


GGTTCGCTCG 


AGCATGAGCT 


AGCCTACACG 


60 




CAGAAATCGC 


TCGGCGAGAT 


GGACTCCTCC 


ACGCACCAGC 


TGCTAATCCA 


AACNCCCAAA 


120 




GATATGGACG 


CCTCGATACT 


GCACCCGAAC 


GCGCTACTGA 


CGCACCTGGA 


CGTGGTGAAG 


180 




AAAGCGATCT 


CGGTGACGGT 


GCACATGTAC 


GACATCACGT 


GGAGNCTCAA 


GGACATGTGC 


240 


Ul 


TACTCGCCCA 


GCATACCGAG 


NTTCGATACG 


CACTTTATCG 


AGCAGATCTT 


CGAGAACATC 


300 


I 2 
: . S 


ATACCGTGCG 


CGATCATCAC 


GCCGCTGGAT 


TGCTTTTGGG 


AGGGA 




345 


J It? 


(2) INFORMATION FOR SEQ ID NO: 8: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IIS amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Lys Val His Gin Leu Trp lie Gin Clu Gly Gly Ser Leu Glu His Glu 
5 10 IS 

Leu Ala Tyr Thr Gin Lys Ser Leu Gly Glu Met Asp Ser Ser Thr His 
20 25 30 

Gin Leu Leu He Gin Thr Pro Lys Asp Met Asp Ala Ser He Leu His 
35 40 45 

Pro Asn Ala Leu Leu Thr His Leu Asp Val Val Lys Lys Ala He Ser 
5° 55 60 

val Thr Val His Met Tyr Asp He Thr Trp Xaa Leu Lys Asp Met Cys 
" 70 75 80 

Tyr Ser Pro Ser He Pro Xaa Phe Asp Thr His Phe He Glu Gin He 
85 90 95 
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Phe Glu Asn He Pro Cys Ala He He Thr ^^fLeu Asp Cys Phe 

ioO 105 110 



Trp Glu Gly 
115 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECXTLE TYPE: cDNA 



S? (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

m 

£^ GGGTCTGTCA CCCGGAGCCG GAGTCCCCGG CGGCCAGCAG CGTCCTCGCG AGCCGAGCGC 

o 

yj CCAGGCGCGC CCGGAGCCCG CGGCGGCGGC GGCAACATGG CCTCGGCTGG TAACGCCGCC 120 

GGGGCCCTGG GCAGGCAGGC CGGCGGCGGG AGGCGCAGAC GGACCGGGGG ACCGCACCGC 



60 



180 
240 
300 



480 
540 
600 



GCCGCGCCGG ACCGGGACTA TCTGCACCGG CCCAGCTACT GCGACGCCGC CTTCGCTCTG 
GAGCAGATTT CCAAGGGGAA GGCTACTGGC CGGAAAGCGC CGCTGTGGCT GAGAGCGAAG 
TTTCAGAGAC TCTTATTTAA ACTGGGTTGT TACATTCAAA AGAACTGCGG CAAGTTTTTG 360 
GTTGTGGGTC TCCTCATATT TGGGGCCTTC GCTGTGGGAT TAAAGGCAGC TAATCTCGAG 420 
ACCAACGTGG AGGAGCTGTG GGTGGAAGTT GGTGGACGAG TGAGTCGAGA ATTAAATTAT 
ACCCGTCAGA AGATAGGAGA AGAGGCTATG TTTAATCCTC AACTCATGAT ACAGACTCCA 
AAAGAAGAAG GCGCTAATGT TCTGACCACA GAGGCTCTCC TGCAACACCT GGACTCAGCA 
CTCCAGGCCA GTCGTGTGCA CGTCTACATG TATAACAGGC AATGGAAGTT GGAACATTTG 660 
TGCTACAAAT CAGGGGAACT TATCACGGAG ACAGGTTACA TGGATCAGAT AATAGAATAC 720 
CTTTACCCTT GCTTAATCAT TACACCTTTG GACTGCTTCT GGGAAGGGGC AAAGCTACAG 
TCCGGGACAG CATACCTCCT AGGTAAGCCT CCTTTACGGT GGACAAACTT TGACCCCTTG 
GAATTCCTAG AAGAGTTAAA GAAAATAAAC TACCAAGTGG ACAGCTGGGA GGAAATGCTG 
AATAAAGCCG AAGTTGGCCA TGGGTACATG GACCGGCCTT GCCTCAACCC AGCCGACCCA 
GATTGCCCTG CCACAGCCCC TAACAAAAAT TCAACCAAAC CTCTTGATGT GGCCCTTGTT 
TTGAATGGTG GATGTCAAGG TTTATCCAGG AAGTATATGC ATTGGCAGGA GGAGTTGATT 1080 
GTGGGTGGTA CCGTCAAGAA TGCCACTGGA AAACTTGTCA GCGCTCACGC CCTGCAAACC 



780 
840 
900 
960 
1020 



1140 
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ATGTTCCAGT TAATGA CAAGCAAATG TATGAACACT TCAGGG^P CGACTATGTC 1200 

TCTCACATCA ACTGGAATGA AGACAGGGCA GCCXKICATCC TGGAGGCCTG GCAGAGGACT 1260 

TACGTGGAGG TGGTTCATCA AAGTGTCGCC CCAAACTCCA CTCAAAAGGT GCTTCCCTTC 1320 

ACAACCACGA CCCTGGACCA CATCCTAAAA TCCTTCTCTG ATGTCAGTGT CATCCGAGTG 1380 

GCCAGCGGCT ACCTACTGAT GCTTGCCTAT GCCTGTTTAA CCATGCTGCG CTGGGACTGC 1440 

TCCAAGTCCC AGGGTGCCCT GGGGCTGGCT GGCGTCCTGT TGGTTGCGCT GTCAGTGGCT 1500 

GCAGGATTGG GCCTCTGCTC CTTGATTGGC ATTTCTTTTA ATGCTGCGAC AACTCAGGTT 1560 

TTGCCGTTTC TTGCTCTTGG TGTTGGTGTG GATGATGTCT TCCTCCTGGC CCATGCATTC 1620 

AGTGAAACAG GACAGAATAA GAGGATTCCA TTTGAGGACA GGACTGGGGA GTGCCTCAAG 1680 

CGCACCGGAG CCAGCGTGGC CCTCACCTCC ATCAGCAATG TCACCGCCTT CTTCATGGCC 1740 

GCATTGATCC CTATCCCTGC CCTGCGAGCG TTCTCCCTCC AGGCTGCTGT GGTGGTGGTA 1800 

TTCAATTTTG CTATGGTTCT GCTCATTTTT CCTGCAATTC TCAGCATGGA TTTATACAGA 1860 

CGTGAGGACA GAAGATTGGA TATTTTCTGC TGTTTCACAA GCCCCTGTGT CAGCAGGGTG 1920 

ATTCAAGTTG AGCCACAGGC CTACACAGAG CCTCACAGTA ACACCCGGTA CAGCCCCCCA 1980 

CCCCCATACA CCAGCCACAG CTTCGCCCAC GAAACCCATA TCACTATGCA GTCCACCGTT 2040 

CAGCTCCGCA CAGAGTATGA CCCTCACACG CACGTGTACT ACACCACCGC CGAGCCACGC 2100 

TCTGAGATCT CTGTACAGCC TGTTACCGTC ACCCAGGACA ACCTCAGCTG TCAGAGTCCC 2160 

GAGAGCACCA GCTCTACCAG GGACCTGCTC TCCCAGTTCT CAGACTCCAG CCTCCACTGC 2220 

CTCGAGCCCC CCTGCACCAA GTGGACACTC TCTTCGTTTG CAGAGAAGCA CTATGCTCCT 2280 

TTCCTCCTGA AACCCAAAGC CAAGGTTGTG GTAATCCTTC TTTTCCTGGG CTTGCTGGGG 2340 

GTCAGCCTTT ATGGGACCAC CCGAGTGAGA GACGGGCTGG ACCTCACGGA CATTGTTCCC 2400 

CGGGAAACCA GAGAATATGA CTTCATAGCT GCCCAGTTCA AGTACTTCTC TTTCTACAAC 2460 

ATGTATATAG TCACCCAGAA AGCAGACTAC CCGAATATCC AGCACCTACT TTACGACCTT 2520 

CATAAGAGTT XCAGCAATGT GAAGTATGTC ATGCTGGAGG AGAACAAGCA ACTTCCCCAA 2580 

ATGTGGCTGC ACTACTTTAG AGACTGGCTT CAAGGACTTC AGGATGCATT TGACAGTGAC 2640 

TGGGAAACTG GGAGGATCAT GCCAAACAAT TATAAAAATG GATCAGATGA CGGGGTCCTC 2700 

GCTTACAAAC TCCTGGTGCA GACTGGCAGC CGAGACAAGC CCATCGACAT TAGTCAGTTG 2760 

ACTAAACAGC GTCTGGTAGA CGCAGATGGC ATCATTAATC CGAGCGCTTT CTACATCTAC 2820 

CTGACCGCTT GGGTCAGCAA CGACCCTGTA GCTTACGCTG CCTCCCAGGC CAACATCCGG 2880 
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CCTCACCGGC CGGAG^^pr CCATGACAAA GCCGACTACA TGCCaIBIc CAGGCTGAGA 
ATCCCAGCAG CAGAGCCCAT CGAGTACGCT CAGTTCCCTT TCTACCTCAA CGGCCTACGA 
GACACCTCAG ACTTTGTGGA AGCCATAGAA AAAGTGAGAG TCATCTGTAA CAACTATACG 
AGCCTGGGAC TGTCCAGCTA CCCCAATGGC TACCCCTTCC TGTTCTGGGA GCAATACATC 
AGCCTGCGCC ACTGGCTGCT GCTATCCATC AGCGTGGTGC TGGCCTGCAC GTTTCTAGTG 
TGCGCAGTCT TCCTCCTGAA CCCCTGGACG GCCGGGATCA TTGTCATGGT CCTGGCTCTG 
ATGACCGTTG AGCTCTTTGG CATGATGGGC CTCATTGGGA TCAAGCTGAG TGCTGTGCCT 
GTGGTCATCC TGATTGCATC TGTTGGCATC GGAGTGGAGT TCACCGTCCA CGTGGCTTTG 
GCCTTTCTGA CAGCCATTGG GGACAAGAAC CACAGGGCTA TGCTCGCTCT GGAACACATC 
TTTGCTCCCG TTCTGGACGG TGCTGTGTCC ACTCTGCTGG GTGTACTGAT GCTTGCAGGG 
TCCGAATTTG ATTTCATTGT CAGATACTTC TTTGCCGTCC TGGCCATTCT CACCGTCTTG 
GGGGTTCTCA ATGGACTGGT TCTGCTGCCT GTCCTCTTAT CCTTCTTTGG ACCGTGTCCT 
GAGGTGTCTC CAGCCAATGG CCTAAACCGA CTGCCCACTC CTTCGCCTGA GCCGCCTCCA 
AGTGTCGTCC GGTTTGCCGT GCCTCCTGGT CACACGAACA ATGGGTCTGA TTCCTCCGAC 
TCGGAGTACA GCTCTCAGAC CACGGTGTCT GGCATCAGTG AGGAGCTCAG GCAATACGAA 
GCACAGCAGG GTGCCGGAGG CCCTGCCCAC CAAGTGATTG TGGAAGCCAC AGAAAACCCT 
GTCTTTGCCC GGTCCACTGT GGTCCATCCG GACTCCAGAC ATCAGCCTCC CTTGACCCCT 
CGGCAACAGC CCCACCTGGA CTCTGGCTCC TTGTCCCCTG GACGGCAAGG CCAGCAGCCT 
CGAAGGGATC CCCCTAGAGA AGGCTTGCGG CCACCCCCCT ACAGACCGCG CAGAGACGCT 
TTTGAAATTT CTACTGAAGG GCATTCTGGC CCTAGCAATA GGGACCGCTC AGGGCCCCGT 
GGGGCCCGTT CTCACAACCC TCGGAACCCA ACGTCCACCG CCATGGGCAG CTCTGTGCCC 
AGCTACTGCC AGCCCATCAC CACTGTGACG GCTTCTGCTT CGGTGACTGT TGCTGTGCAT 
CCCCCGCCTG GACCTGGGCG CAACCCCCGA GGGGGGCCCT GTCCAGGCTA TGAGAGCTAC 
CCTGAGACTG ATCACGGGGT ATTTGAGGAT CCTCATGTGC CTTTTCATGT CAGGTGTGAG 
AGGAGGGACT CAAAGGTGGA GGTCATAGAG CTACAGGACG TGGAATGTGA GGAGAGGCCG 
TGGGGGAGCA GCTCCAACTG AGGGTAATTA AAATCTGAAG CAAAGAGGCC AAAGATTGGA 
AAGCCCCGCC CCCACCTCTT TCCAGAACTG CTTGAAGAGA ACTGCTTGGA ATTATGGGAA 
GGCAGTTCAT TGTTACTGTA ACTGATTGTA TTATTKKGTG AAATATTTCT ATAAATATTT 
AARAGGTGTA CACATGTAAT ATACATGGAA ATGCTGTACA GTCTATTTCC TGGGGCCTCT 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 
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CCACTCCTGC CCCAGA(^^ GGAGACCACA GGGGCCCTTT CCCCTG CATTGGTCTC 
TGTGCCACAA CCAAGCTTAA CTTAGTTTTA AAAAAAATCT CCCAGCATAT GTCGCTGCTG 
CTTAAATATT GTATAATTTA CTTGTATAAT TCTATGCAAA TATTGCTTAT GTAATAGGAT 
TATTTGTAAA GGTTTCTGTT TAAAATATTT TAAATTTGCA TATCACAACC CTGTGGTAGG 
ATGAATTGTT ACTGTTAACT TTTGAACACG CTATGCGTGG TAATTGTTTA ACGAGCAGAC 
ATGAAGAAAA CAGGTTAATC CCAGTGGCTT CTCTAGGGGT AGTTGTATAT GGTTCGCATG 
GGTGGATGTG TGTGTGCATG TGACTTTCCA ATGTACTGTA TTGTGGTTTG TTGTTGTTGT 
TGCTGTTGTT GTTCATTTTG 6TGTTTTTGG TTGCTTTGTA TGATCTTAGC TCTGGCCTAG 

^ GTGGGCTGGG AAGGTCCACG TCTTTTTCTG TCGTGATGCT GGTGGAAAGG TGACCCCAAT 

Li? 

CATCTGTCCT ATTCTCTGGG ACTATTC 

M 

m (2) INFORMATION FOR SEQ ID NO: 10: 

□ (i) SEQUENCE CHARACTERISTICS: 

yj (A) LENGTH: 1434 amino acids 

qj (B) TYPE: amino acid 

1" (C) STRANDEDNESS : single 

PI (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ser Ala Gly Asn Ala Ala Gly Ala Leu Gly Arg Gin Ala Gly 
^ 5 10 
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Gly Gly Arg Arg Arg Arg Thr Gly Gly Pro His Arg Ala Ala Pro Asp 
20 25 30 

Arg Asp Tyr Leu His Arg Pro Ser Tyr Cys Asp Ala Ala Phe Ala Leu 
35 40 45 

Glu Gin He Ser Lys Gly Lys Ala Thr Gly Arg Lys Ala Pro Leu Trp 
50 55 60 

Leu Arg Ala Lys Phe Gin Arg Leu Leu Phe Lys Leu Gly Cys Tyr He 
" 70 75 80 

Gin Lys Asn Cys Gly Lys Phe Leu Val Val Gly Leu Leu He Phe Gly 
65 90 95 

Ala Phe Ala Val Gly Leu Lys Ala Ala Asn Leu Glu Thr Asn Val Glu 
100 105 110 

Glu Leu Trp Val Glu Val Gly Gly Arg Val Ser Arg Glu Leu Asn Tyr 
ii5 120 125 



4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5187 
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Thr Arg Gin fBlle Gly Clu Clu Ala Met Phe Gin Leu Met 

130 135 14^^ 



lie Gin Thr Pro Lys Glu Glu Gly Ala Asn Val Leu Thr Thr Glu Ala 



l'*S ISO 



160 



Leu Leu Gin His Leu Asp Ser Ala Leu Gin Ala Ser Arg Val His Val 
165 170 175 

Tyr Met Tyr Asn Arg Gin Trp Lys Leu Glu His Leu Cys Tyr Lys Ser 
180 185 190 

Cly Glu Leu He Thr Glu Thr Gly Tyr Met Asp Gin He He Glu Tyr 
195 200 205 

Leu Tyr Pro Cys Leu He He Thr Pro Leu Asp Cys Phe Trp Glu Glv 
210 215 220 

Ala Lys Leu Gin Ser Gly Thr Ala Tyr Leu Leu Gly Lys Pro Pro Leu 

230 235 240 

Arg Trp Thr Asn Phe Asp Pro Leu Glu Phe Leu Glu Glu Leu Lys Lys 
245 250 255 

He Asn Tyr Gin Val Asp Ser Trp Glu Glu Met Leu Asn Lys Ala Glu 
260 265 270 

Val Gly His Gly Tyr Met Asp Arg Pro Cys Leu Asn Pro Ala Asp Pro 
275 280 285 

Asp Cys Pro Ala Thr Ala Pro Asn Lys Asn Ser Thr Lys Pro Leu Asp 
290 295 300 

Val Ala Leu Val Leu Asn Gly Gly Cys Gin Gly Leu Ser Arg Lys Tyr 

310 315 320 

Met His Trp Gin Glu Glu Leu He Val Gly Gly Thr Val Lys Asn Ala 
325 330 335 

Thr Gly Lys Leu Val Ser Ala His Ala Leu Gin Thr Met Phe Gin Leu 
340 345 

Met Thr Pro Lys Gin Met Tyr Glu His Phe Arg Gly Tyr Asp Tyr Val 
355 360 365 

Ser His He Asn Trp Asn Glu Asp Arg Ala Ala Ala He Leu Glu Ala 
3''° 375 380 

Trp Gin Arg Thr Tyr Val Glu Val Val His Gin Ser Val Ala Pro Asn 



385 390 395 



400 



Ser Thr Gin Lys Val Leu Pro Phe Thr Thr Thr Thr Leu Asp Asp He 
405 410 415 



Leu Lys Ser Phe Ser Asp Val Ser Val 
420 425 

Leu Leu Met Leu Ala Tyr Ala Cys Leu Thr Met Leu Arg Trp Asp Cys 



He Arg Val Ala Ser Gly Tyr 
420 425 430 
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435 440 



S r Lys ser Gin Gly Ala Val Gly Leu Ala Gly Val Leu Leu Val Ala 

Leu Ser Val Ala Ala Gly Leu Gly Leu Cys Ser Leu lie Gly He Ser 

480 

Phe ABn Ala Ala Thr Thr Gin Val Leu Pro Phe Leu Ala Leu Gly Val 
485 495 

Gly val Aap Asp Val Phe Leu Leu Ala His Ala Phe Ser Glu Thr Gly 
500 ^ 



510 



Gin A8„ Lys Arg He Pro Phe Glu Asp Arg Thr Gly Glu Cys Leu Lys 
515 520 525 ' 

Arg Thr Gly Ala Ser Val Ala Leu Thr Ser He Ser Asn Val Thr Ala 

535 540 

Phe Phe Met Ala Ala Leu He Pro He Pro Ala Leu Arg Ala Phe Ser 

555 560 

I.eu Gin Ala Ala Val Val Val Val Phe Asn Phe Ala Met Val Leu Leu 

570 575 

lie Phe Pro Ala He Leu Ser Met Asp Leu Tyr Arg Arg Glu Asp Arg 

585 

Arg Leu Asp He Phe Cys Cys Phe Thr Ser Pro Cys Val Ser Arg Val 
595 600 605 

lie Gin val Glu Pro Gin Ala Tyr Thr Glu Pro His Ser Asn Thr Arg 

Tyr ser Pro Pro Pro Pro Tyr Thr Ser His Ser Phe Ala His Glu Thr 

"° "5 640 

His He Thr Met Gin Ser Thr Val Gin Leu Arg Thr Glu Tyr Asp Pro 

650 655 

His Thr His val Tyr Tyr Thr Thr Ala Glu Pro Arg Ser Glu He Ser 

665 670 



val Gin Pro Val Thr Val Thr Gin Asp Asn Leu Ser Cys Gin Ser Pro 

680 685 

Glu ser Thr Ser Ser Thr Arg Asp Leu Leu Ser Gin Phe Ser Asp Ser 



680 685 

Asp Leu Leu Ser Gin 
"5 700 

ser Leu His Cys Leu Glu Pro Pro Cys Thr Lys Trp Thr Leu Ser Ser 

715 720 

Phe Ala Glu Lys His Tyr Ala Pro Phe Leu Leu Lys Pro Lys Ala Lys 
725 730 735 

val val Val He Leu Leu Phe Leu Gly Leu Leu Gly Val Ser Leu Tyr 
740 745 ^ 



750 
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a;^^*^ ^9 Asp Gly Leu Asp Leu Thi^^ 



Gly Thr Thr ^„ . _.. 

765 



755 



Arg Glu Thr Arg Glu Tyr Asp Phe.Ile Ala Ala Gin Phe Lys Tyr Phe 

775 

Ser Phe Tyr Asn Met Tyr lie Val Thr Gin Lys Ala Asp Tyr Pro Asn 

"° 795 800 

Tyr Asp Leu His Lys Ser Phe Se»- 
80S 810 



He Gin His Leu Leu Tyr Asp Leu His Lys Ser Phe Ser Asn Val Lys 
805 810 815 

Tyr val Met Leu Glu Glu Asn Lys Gin Leu Pro Gin Met Trp Leu His 
«20 825 830 

Tyr Phe Arg Asp Trp Leu Gin Gly Leu Gin Asp Ala Phe Asp Ser Asp 
835 840 845 

Trp Glu Thr Gly Arg He Met Pro Asn Asn Tyr Lys Asn Gly Ser Asp 

855 860 

Asp Gly val Leu Ala Tyr Lys Leu Leu Val Gin Thr Gly Ser Arg Asp 

875 880 



I-ys Pro lie Asp lie Ser Gin Leu Thr Lys Gin Arg Leu Val Asp Ala 
885 890 895 

Asp Gly lie lie Asn Pro Ser Ala Phe Tyr He Tyr Leu Thr Ala Trp 
900 905 

val ser Asn Asp Pro Val Ala Tyr Ala Ala Ser Gin Ala Asn He Arg 

920 

Pro His Arg Pro Glu Trp Val His Asp Lys Ala Asp Tyr Met Pro Glu 



940 



965 970 



Thr Arg Leu Arg He Pro Ala Ala Glu Pro He Glu Tyr Ala Gin Phe 

955 960 
Pro Phe Tyr Leu Asn Gly Leu Arg Asp Thr Ser Asp Phe Val Glu Ala 

975 

He Glu Lys Val Arg Val He Cys Asn Asn Tyr Thr Ser Leu Gly Leu 
980 985 990 

ser Ser Tyr Pro Asn Gly Tyr Pro Phe Leu Phe Trp Glu Gin Tyr He 
995 1000 1005 

^^"^ ^O^o'^^ """^ "-^^ Val Leu Ala Cys 

-lOiO 1015 ^Q20 

Thr Phe Leu Val Cys Ala Val Phe Leu Leu Asn Pro Trp Thr Ala Gly 

1035 1040 



He He val Met Val Leu Ala Leu Met Thr Val Glu Leu Phe Gly Met 
1045 1050 1055 

Met Gly Leu He Gly He Lys Leu Ser Ala Val Pro Val Val He Leu 



1065 1070 



lie Ala ser Val Gly He Gly Val Glu Phe Thr Val His Val Ala Leu 
1075 1080 1085 

Ala Phe Leu Thr Ala He Gly Asp Lys Asn His Arg Ala Met Leu Ala 

1095 1100 

r.eu^Glu His Met Phe Ala^Pro Val Leu Asp Gly Ala Val Ser Thr Leu 



1110 Ills 



1120 



Leu Gly val Leu Met Leu Ala Gly Ser Glu Phe Asp Phe He Val Arg 
"25 1130 ^^35 

Tyr Phe Phe Ala Val Leu Ala He Leu Thr Val Leu Gly Val Leu Asn 
11*0 1145 1150 

□ Gly Leu Val^Leu Leu Pro Val Leu^Leu Ser Phe Phe Gly^Pro Cys Pro 

Glu val ser Pro Ala Asn Gly Leu Asn Arg Leu Pro Thr Pro Ser Pro 
^ "■'S 1180 

S h'85''^° ^'^^ Itlo''^' ^ro Pro Gly His Thr 

Rl 1195 1200 

b """" """" Tyr ser Ser Gin Thr Thr 

i 1210 1215 

□ val ser Gly He Ser Glu Glu Leu Arg Gin Tyr Glu Ala Gin Gin Gly 

^ ^225 1230 

: Ala Gly Gly^pro Ala His Gin Vallle Val Glu Ala Thr Glu Asn Pro 



1240 



1245 



val Phe Ala Arg Ser Thr Val Val His Pro Asp Ser Arg His Gin Pro 

-"-255 1260 

Pro Leu Thr Pro Arg Gin Gin Pro His Leu Asp Ser Gly ser Leu Ser 

1275 1280 

Pro Gly Arg Gin Gly Gin Gin Pro Arg Arg Asp Pro Pro Arg Glu Gly 

1290 1295 

Leu Arg Pro Pro Pro Tyr Arg Pro Arg Arg Asp Ala Phe Glu He Ser 
"00 1305 13^0 

Thr Glu «ly^His ser Gly Pro Ser Asn Arg Asp Arg Ser Gly Pro Arg 



1320 1325 



Gly Ala Arg Ser His Asn Pro Arg Asn Pro Thr Ser Thr Ala Met Gly 

1335 134Q 

ser ser Val Pro Ser Tyr Cys Gin Pro He Thr Thr Val Thr Ala Ser 

1355 1360 



Ala Ser Val Thr Val Ala Val 



lift.; <^ly Arg Asn 

"65 1370 1375 
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Pro Arg ciy cAro Cys Pro Gly Tyr Glu Ser T^o Glu Thr Asp 

1385 1390 

His Gly yal Phe Giu Asp Pro His Val Pro Phe His Val Arg Cys Glu 
1395 1400 



1405 



?I?n*°^ ^^"^ ''^^ Asp Val Glu cys 



1415 1420 



Glu Glu Arg Pro Trp Gly Ser Ser Ser Asn 
1425 1430 

(2) INFORMATION FOR SEQ ID NO: lis 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECtJLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 




(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

I-eu lie Val Gly Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Pro Phe Phe Trp Glu Gin Tyr 
^ 5 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGACGAATTC AARGTNCAYC ARYTNTGG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15; 
GGACGAATTC CYTCCCARAA RCANTC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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GGACGAATTC yTNGANT^J^TYTCGGA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - -primer' 



27 



Q <''^> SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

^ CATACCAGCC AAGCTTGTCN GGCCARTGCA T 

M 31 

Lfl (2) INFORMATION FOR SEQ ID NO-18- 



_ (i) SEQUENCE CHARACTERISTICS; 
W LENGTH: 5288 base pairs 

ni (B) TYPE: nucleic acid 

l~ (C) STRANDEDNESS: single 

Q (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
OAATTCCGCG GACCCCAAGG AGTGCCGCGG AAGCGCCCGA AGGACAGGCT CGCTCCGCGC 
OCCGGCTCTC GCTCTTCCGC GAACTGGATG TGOGCAGCGG CGGCCGCAGA GACCTCGGGA 
CCCCCGCOCA ATOTGGCAAT GGAAGGCGCA GGGTCTGACT CCCCGGCAGC GGCCGCGGCC 
GCAGCGGCAG CAGCGCCCGC CGTGTGAGCA GCAGCAGCGG CTGGTCTGTC AACCGGAGCC 
CGAGCCCGAG CAGCCTGCGG CCAGCAGCGT CCTCGCAAGC CGAGCGCCCA GGCGCGCCAG 
OAGCCCGCAO CAGCGOCAGC AGCGCOCCGG GCCGCCCGGG AAGCCTCCGT CCCCGCGGCG 
CCGGCGGCGG CGGCGGCGGC AACATGCCCT CGGCTGGTAA CGCCGCCGAG CCCCAGGACC 
CCGCCGGCGG CGGCAGCGGC TGTATCGGTG CCCCGGGACG GCCGGCTCCA GGCGGGAGGC 
GCAGACGGAC GGCGCGGCTG CGCCGTGCTG CCGCGCCGGA CCOGGACTAT CTGCACCGCC 
CCAGCTACTG CGACGCCGCC TTCGCTCTGC AGCAGATTTC CAAGGGCAAO GCTACTGGCC 
CGAAAGCGCC ACTGTGGCTG AGAGCGAAGT TTCAGAGACT CTTATTTAAA CTGGGTTGTT 
ACATTCAAAA AAACTGCGGC AAGTTCTTGG TTGTGGGCCT CCTCATATTT GGGGCCTTCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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OAOCACACT «<=,CCT=« „««„AT» CTCOC«C« OATT««« ««CT»«T 
TT«ICCT« «CT«KATA «OACCCCTA «0«OA;V« TOCTAATCTC C«ACCAC«C 
««0=CT=CT »C«„CCT» 0»CTC»C»C TC„=OC«= C««TC«T <=TATA«T=T 
A=««0«« ^^^^^^ ^^^^^^ ^^^^^^ 

«OCTTA„T «««T„C TTTACCCTTO TTTOATTATT *«CCTTT« 

AO«CTX=TO «MO«C« .«„A«CT CT«0»«OC «ACCT==T. «X««CCTC 
<=TTT«C«T= «C«*CTTC «CCCTTT« «XTCCT«. ^r^„ 
»T«A»XO=A CCC^^ ^^^^^ ^^^^^^ 

ACCOCCCCTO CCTCAATC« COCOATCCAO AC«CCCCCC CACACCCCCC AACAAAAATT 
CAAOCAAACC ,„TCATAT= «CCCTTOTTT TCAAXOCXCO ATOTCATCOC TTATCCACAA 
AOTA.ATOCA CXOCCAC.AC CAOTTCAT« X«CTCOCAC AOrCAACAAC ACCACTCOAA 
AACTCC.CAO COCCCATOCO CTOCACACCA TOTTCCACTT AAT<=ACTCCC AACCAAATC. 
ACOAOCACTT CAACOOOTAC CACTAT^CT CACACATCAA CTCOAACCA. CACAAAOCCC 
CAOCCATCCT OCAOCCCTOO CAOACOACAT ATOTCCAOOT OCTTCATCAO ACTCT^CAC 
"OAACTCCAC TCAAAAOCTC CTTTCCTTCA COACCACCAC CCTCOACCAC ATCCTCAAAT 
CCTTCTCTCA C0TCA3T.XC ATCCCTCO CCAOCCOOTA CTTACTCAXC CTCCCCTATO 
CCTCTCTAAC CATCCTCC== T«OACT=CX CCAA.TCCCA 0=OrCCCCT= .OCCXCCCTO 
^COTCCTOCT O.TTCCACTC TCAOTOOCTO CA=OACTO== CCTCTCCTCA „CATC«=AA 
TTTCCTTTAA C=C«CAACA ACTCACOXTT .OCCATTTCT C=CXCTTO„ 
ATOATCT.„ TCTTCTOOCC CACOCCTXCA OTOAAACA^C ACACAATAAA ACAATCCCTT 
TTOA«ACA= «CC«COAO TCCCTOAACC OCACA=OA== CAOC3TOCCC CTCACCTOCA 
TCA=CAATOT CACACCCTTC TTCATO.CCO CCTTAAtecO AATTCCCCCT CTOCOCC.T 
TCTCCCTCCA <=<K:A<=C=CTA 0TA=T0<™=T TCAATTTT.C CATOOTTCTC CTCATTTTTC 
CTCCAATTCT CACCAT«AT TTATATCOAC G<:CAO«AC.O OACACTO«T ATTTTCTCCT 
OTTTTACAAC CCCCTOCOTC AOCACACTCA „CA«XXO. ACCTCACCCC TACACCOACA 
CACACOACAA TACCCOCTAC ACCCCCCCAC CTCOCTACC CAOCCACACC TTTOCCCATC 
A«CCCA«T TACCATOCAC TCCACTOTCC ACCTCC=CAC OCACTACCAC CCCCACA«C 
AC^TACTA CACCACCCOX CAOCCOCOOT CCCAOATCTC «=TCCA=CCC CTCACCOTOA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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cc^cx^c „.„e„cc cccc^e .c«=ccccc cx=™ «o.„cxc. 

ro^^ ^^^^^ ^^^^^^ 

-Tc^ccx ^^^^^ ^^^^^^^^ 

.0=«CT«. CCTX,C«.c ,™=..cOTC OOC««cC.C 

«=««C« ^.CXTXTC ^^^^^^^^ 

««r«C„ «„»cc.,c »c»cc„^ „oT«ca« 

— ^c^. ^^ ^^^^^ ^^^^ 

««»CX.« „„CTC»CT «.«.cCCO CC«.c«„ 

^^r^ „„«.c=„ ^.cxccxo c„,«„cx ccxoc.c„ .ccccc«cc 
c«««.cc „x«.c..c ««„c™. ^^^^^^^ 
-T.«.cc „,ec r.^^. 

co^c cccc^ccc «„.cc..c 

cc=»c„c„ ^c.^„ ^^^^^^^ ^^^^^^^^^^ 

««ccc.„ cx.ccxc«c c«„„«co .cccco. c„„...,. 
««T«„,c c„c.c«=c «c„..cc. o.™c ccc^cccc 

-CCCTXCCT C..c^,„ ^^^^^^^^^ ^^^^^^ ^^^^^ 

occx=o.„ ^^^^^^^ 
ccoc^xc. ^^^^^^ ^^^^^^^^^^^ 

T«.c=c«. c„.„«„ ,«.e„c„ o..coc„„ 

o»=.«=«„ „™ c„.cx^ 

cc^eccc „o„c.ro. „o„ccc.. cc..„„, 
cTcxocooc «.oc.,„ ^^^^^^ ^^^^^^^ 

c.<.„cc.c «c„.cc^ .cc„c^ 
«™c „.c„x«. o«,„ec^ 

xccccccc c.cccc.=« cccccccc ccc.oc.cco cx.cccc.xc cccccccocc 

.«CCC,„C CCOCXCXO.X XCCXCCC.CX CCO.OX.X.O XXCCC.C.CC ,„cxcx«c 
-CXC.OCO. CC.CCXXCCC C.CX.CC.OC CCC.CC.OCC CCCCCO.OCC ccxoccc.cc 

-XC.XCOX oc^cccc. c.«.ccccc xcxxccccc. cxcccxoxo cxccxcccc 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 . 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
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--c, ^^^^ 

— «cc«^ 

«-c«e J— - ~c„__ _ 

— — „ec ^^^^ 

,„c^ ^ "'^"^^ c««.«,, ^^^^ 

c„._ - jcc«„, ^^^^ 

— 0«C=C«.. 17"="^ 

„ r^"" 
:r:r ^^^"'-^ - 

«C«0CXTC <=«TTC<=CCT C«TOO«„ 

«„c„.. :rr 

JJ""" — - ^^^rrr. 

— — ^^.c..„ 

CCGGAATT ^GTTTAA TGAGCAGATA TGAAGAAAGC 



4620 
4680 
4740 
4800 



5040 
5100 
5160 
5220 
5280 



<i) seqj;e,,ce characteristics - 

(A) LENGTH: I447 amt 
amino acid 
(C) STRANDEONESS- sin^i 

(ii) MOLECOi^ TVPE.. protein 

(-i) SEQUENCE DESCRlPxioN- SEO rn ' 

SEQ ID NO: 19. 

1" *=n Al, «a oi„ p ' 

<=ly S.r Gly cys He Glj, , " 

■an 



30 

56 



Arg 

W 40 ^5 



Tyr Leu Hia Arg Pro Ser 



50 



o Asp Arg Asp 



cys Asp Ala Ala Phe Ala Leu gIu g1„ 



60 



^X. se. 

80 

«a p.. ^„ ^„ 

95 

Asn Cys Gly Lys phe Leu Val Val r<. r 

100 ^ ""^^ Phe Gly Ala Phe 

■^"^ 110 

vax ^„ 

125 



Trp Val ^ 



135 



140 



Tyr Thr Arg 



Cln Lys He Gly gIu Glu Ala Met Phe Asn Pro Gin 



150 



155 



Leu Met 11© Gin 

160 
P.O ... 

" 190 

Tyr Asn Arg Gin Trp Lys Leu Glu 



195 



200 



His Leu Cys Tyr Lys 



r.eu lie Thr Glu Thr Gly Tyr Met Asp Gin XI 



205 



Ser Gly Glu 



215 



e lie Glu Tyr Leu Tvr 
220 

- .eu „e ^. 

240 

- .e. ^^^^^ ^^^^ 

- ... ^„ 

270 

cx„ v_._. .e. 



275 



285 



T,. «« ^, 

300 

Pro Ala Thr Ala Pro Asn Lys Asn Ser Th^ t « 

305 3j^j y« ser Thr Lys Pro Leu Asp Met Ala 

320 

Leu Val Leu Asn Gly Gly Cvs Hi« r-i r 

325 ^ ^ '''^ ^^'^ I-y- Tyr Met His 

335 

Trp Gin Glu Glu Leu He Val Gly Gly Thr Val Lv, . 

/ «xy inr Val Lys Asn Ser Thr Gly 



345 



350 



Lys Leu Val Ser Ala His Ala Leu Gin Thr Met Ph^ ri« r 

355 ^eu Met Thr 



365 



Pro .^s ox„ „« ty. ol„ Hi. I.,, cl, ryr al„ X,. v.l se. Hi, 

380 

He Asn Trp Asn Glu Asp Lys Ala Ala ai^ ti« r 

385 Ala Trp Gin 



395 



Arg Thr Tyr Val Glu Val Val His Gin Ser 



405 



410 



Val Ala Gin Asn Ser 
415 



400 



Thr 



Oln val s„ .h. Thr Thr Thr Ihr «p Aop II. 



430 



ser Phe Ser Asp Val Ser Val He Arg Val Ala Ser 



440 



Gly Tyr Leu Leu 
445 



H=t ryr Ma Cy. Jeu Thr Het ^, Trp «p c^s .a. 

"^^^ 460 
ser Gl„ Ol, val Oly !..„ «a «y Val Leu Val 

^'^^ 480 
val Ala Ala 01. J.„ oi, cy. Se. ci, 



485 

Ala Ala Thr Thr Gin 
500 



490 



495 



Val Leu Pro Phe Leu Ala Leu Gly Val Gly 



505 



Val 



510 



Asp Asp Val Phe Leu Leu Ala His Al 



515 



520 



a Phe Ser Glu Thr Gly Gin Asn 



525 



t-y^ AT, II, p„ p,, 

540 

Oly Ala ser Val Ala Leu Thr Ser He Ser Asn Val Thr Ala Phe 



555 



Phe 
560 



«et Al. Al. .e. II. P„ „e P.o Ala Ar, Ala Phe s.r ol„ 

Ala Ala V.1 val Val v.l Ph. A.„ Pha Ala „.t V.l p,. 



585 



590 



Pro Ala He Leu 
595 



ser Met Asp Leu Tyr Arg Arg Glu Asp Arg Arg 
600 ^ 



Leu 



605 



Asp lie Phe cys Cys Phe Thr Ser Pro Cys Val 



615 



Ser Arg Val He Gin 



620 



val Olu Pro cm Ala Tyr Thr Asp Thr His Asp Asn Thr Arg Tyr 



630 

Pro Pro Pro Pro Tyr Ser 
645 



635 



Ser 
640 



Ser His Ser Phe Ala His 
650 



Glu Thr Gin He 
655 



58 



ijl^frhr Val Gin I^u Arg Thr Glu Tj^^j 



Thr Met Gin 

„g .^nj, jj^y Ti^»p Pro His Thr 

665 670 
His Val Tyr Tyr Thr Thr Ala Glu Pro Arg Ser Glu lie Ser Val Gin 



Pro val Thr Val Thr Gin Asp Thr I.eu Ser Cys Gin Ser Pro Glu Ser 

Thr ser Ser Thr Arg Asp Leu Leu Ser Gin Phe Ser Asp Ser Ser Leu 

His cys Leu Glu Pro Pro Cys Thr Lys Trp Thr Leu Ser Ser Phe Ala 

730 

Glu Lys His Tyr Ala Pro Phe Leu Leu Lys Pro Lys Ala Lys Val Val 

'^^^ 750 

val lie Phe Leu Phe Leu Gly Leu Leu Gly Val Ser Leu Tyr Gly Thr 

760 

Thr Arg Val Arg Asp Gly Leu Asp Leu Thr Asp He Val Pro Arg Glu 

780 

Thr Arg Glu Tyr Asp Phe He Ala Ala Gin Phe Lys Tyr Phe Ser Phe 

"^^^ 800 

Tyr Asn Met Tyr lie Val Thr Gin Lys Ala Asp Tyr Pro Asn He Gin 
805 810 

His Leu Leu Tyr Asp Leu His Arg Ser Phe Ser Asn Val Lys Tyr Val 



825 



830 



Met Leu Glu Glu Asn Lys Gin Leu Pro Lys Met Trp Leu His Tyr Phe 

840 



Arg Asp Trp Leu Gin Gly Leu Gin Asn aIa pho hc.^ o 

850 occ ^ Trp Glu 

860 

Thr Gly Lys He Met Pro^ Asn Asn Tyr Lys Asn Gly Ser Asp Asp Gly 

880 

val Leu Ala Tyr Lys Leu Leu Val Gin Thr Gly Ser Arg Asp Lys Pro 

890 895 
lie ASP He Ser Gin Leu Thr Lys Gin Arg Leu Val Asp Ala Asp Gly 

905 9JL0 

lie lie Asn Pro Ser Ala Phe Tyr He Tyr Leu Thr Ala Trp Val Ser 

920 

AS„ A.P p„ v,l «. iy, „ 3„ 

940 

Arg Pro Glu Trp Val His Asp Lys Ala Asp Tyr Met Pro Glu Thr Arg 

960 

I.eu Arg He Pro Ala Ala Glu Pro He Glu Tyr Ala Gin Phe Pro Phe 



59 



5" 970 975 

Tyr Leu Asn Gly Leu Arg Aap Thr Ser Asp Phe Val Glu Ala He Glu 

985 990 

Lys val Arg Thr lie Cya Ser Asn Tyr Thr Ser Leu Gly Leu Ser Ser 

1000 1005 

fnfn'^^" <51n Tyr He Gly Leu 

1015 1020 

Arg His Trp Leu Leu Leu Phe He Ser Val Val Leu Ala Cys Thr Phe 

J-O^O 1035 1040 

Leu Val Cya Ala Val Phe Leu Leu Asn Pro Trp Thr Ala Gly He He 
1045 1050 1055 

Val Met val Leu Ala Leu Met Thr Val Glu Leu Phe Gly Met Met Glv 

1065 1070 

Leu He Gly lie Lys Leu Ser Ala Val Pro Val Val He Leu He Ala 

1080 1085 



Ser Val Gly He Gly Val Glu Phe 
1090 1095 



Thr Val His Val Ala Leu Ala Phe 
1100 



Leu Thr Ala He Gly Asp Lys Asn Arg Arg Ala Val Leu Ala Leu Glu 

"10 1115 1120 

His Met Phe Ala Pro Val Leu Asp Gly Ala Val Ser Thr Leu Leu Gly 
1125 1130 2.135 

Val Leu Met Leu Ala Gly Ser Glu Phe Asp Phe He Val Arg Tyr Phe 

1145 1150 

Phe Ala val Leu Ala He Leu Thr He Leu Gly Val Leu Asn Gly Leu 
1155 1160 1165 

val Leu Leu Pro Val Leu Leu Ser Phe Phe Gly Pro Tyr Pro Glu Val 
1170 1175 1180 

^^1/''° Arg Leu Pro Thr Pro Ser Pro Glu Pro 

1190 1195 1200 

Pro Pro Ser Val Val Arg Phe Ala Met Pro Pro Gly His Thr His Ser 
1205 1210 1215 

Gly Ser Asp Ser Ser Asp Ser Glu Tyr Ser Ser Gin Thr Thr Val Ser 
1220 1225 1230 

Gly Leu ser Glu Glu Leu Arg His Tyr Glu Ala Gin Gin Gly Ala Gly 
1235 1240 1245 

Gly Pro Ala His Gin Val He Val Glu Ala Thr Glu Asn Pro Val Phe 

1255 1260 

Ala His ser Thr Val Val His Pro Glu Ser Arg His His Pro Pro Ser 

1275 1280 



Asn Pro Arg ^^in Pro Hia Leu Asp Ser Gly Se^Bu Pro Pro civ 
^1285 1290 1295 

Arg Gin Gly Gin Gin Pro Arg Arg Aap Pro Pro Arg Glu Gly Leu Trp 
1300 1305 1310 

Pr Pro Leu Tyr Arg Pro Arg Arg Asp Ala Phe Glu He Ser Thr Glu 
1315 1320 1325 

Gly His Ser Gly Pro Ser Aan Arg Ala Arg Trp Gly Pro Arg Gly Ala 
1330 1335 

Arg Ser His Aen Pro Arg Aan Pro Ala Ser Thr Ala Met Gly ser Ser 

1350 1355 1360 

Val .Pro Gly Tyr Cys Gin Pro He Thr Thr Val Thr Ala Ser Ala Ser 
1365 1370 1375 

val Thr Val Ala Val His Pro Pro Pro Val Pro Gly Pro Gly Arg Asn 
1380 1385 1390 

Pro Arg Gly Gly Leu Cys Pro Gly Tyr Pro Glu Thr Asp His Gly Leu 
1395 1400 1405 

Phe Glu Asp Pro His Val Pro Phe His Val Arg Cys Glu Arg Arg Aso 
1410 1415 1420 

Ser Lys Val Glu Val He Glu Leu Gin Asp Val Glu Cys Glu Glu Ara 
^'^^S 1430 1435 1440 

Pro Arg Gly Ser Ser Ser Aan 
1445 



